A Matrix-Type for Performance-Portability

نویسنده

  • N. Peter Drakenberg
چکیده

We describe a datatype for (dense) matrices whose primitive operations are decomposition and composition (of submatrices), as opposed to indexed element access which is the primitive operation on conventional arrays. Using the composition and decomposition operations it is for example possible to express both recursive and traditional block matrix algorithms (e.g., Cholesky factorization, QR-factorization, etc.) as is illustrated by the following (pretty-printed and contracted) example: function cholesky: Matrix → Matrix is cholesky( [ a ] ) = [√ a ] .. cholesky( [ A B C D ] ) = let U = cholesky( A ) .. Z = zeros( C )

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dealing with performance/portability and performance/accuracy trade-offs in heterogeneous computing systems: A case study with matrix multiplication modulo primes

We present the study of two important trade-offs in heterogeneous systems (i.e., between performance versus portability and between performance and accuracy) for a relevant linear algebra problem, matrix multiplication modulo primes. Integer matrix linear algebra methods rely heavily on matrix multiplication modulo primes. Double precision is necessary for exact representation of sufficiently m...

متن کامل

Stability Analysis of a Matrix Converter Drive: Effects of Input Filter Type and the Voltage Fed to the Modulation Algorithm

The matrix converter instability can cause a substantial distortion in the input currents and voltages which leads to the malfunction of the converter. This paper deals with the effects of input filter type, grid inductance, voltage fed to the modulation algorithm and the synchronous rotating digital filter time constant on the stability and performance of the matrix converter. The studies are ...

متن کامل

Fault Type Estimation in Power Systems

This paper presents a novel approach for fault type estimation in power systems. The Fault type estimation is the first step to estimate instantaneous voltage, voltage sag magnitude and duration in a three-phase system at fault duration. The approach is based on time-domain state estimation where redundant measurements are available. The current based model allows a linear mapping between the m...

متن کامل

Multi-threaded Sparse Matrix-Matrix Multiplication for Many-Core and GPU Architectures

Sparse Matrix-Matrix multiplication is a key kernel that has applications in several domains such as scientific computing and graph analysis. Several algorithms have been studied in the past for this foundational kernel. In this paper, we develop parallel algorithms for sparse matrixmatrix multiplication with a focus on performance portability across different high performance computing archite...

متن کامل

A High Performance, Portable Distributed BLAS Implementation

In this paper, we give a report on recent developments for the Distributed BLAS (DBLAS) project. These include a powerful distributed matrix representation which yields a simple interface to the DBLAS, and the redesign the DBLAS algorithms terms of powerfuìspread' and`reduce' matrix communication operations for reasons of programmability. The DBLAS codes achieve portability by supporting BLACS ...

متن کامل

Writing a performance-portable matrix multiplication

There are several frameworks that, while providing functional portability of code across different platforms, do not automatically provide performance portability. As a consequence, programmers have to hand-tune the kernel codes for each device. The Heterogeneous Programming Library (HPL) is one of these libraries, but it has the interesting feature that the kernel codes, which implement the co...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004